中国科学技术信息研究所--国家工程技术数字图书馆

1. Multimedia data mining: state of the art and challenges

[机翻] 多媒体数据挖掘：现状与挑战

[期刊] Chidansh Amitkumar Bhatt Mohan S. Kankanhalli 《Multimedia Tools and Applications》 2011年51卷1期共42页

摘要 : Advances in multimedia data acquisition and storage technology have led to the growth of very large multimedia databases. Analyzing this huge amount of multimedia data to discover useful knowledge is a challenging problem. This ch... 展开 Advances in multimedia data acquisition and storage technology have led to the growth of very large multimedia databases. Analyzing this huge amount of multimedia data to discover useful knowledge is a challenging problem. This challenge has opened the opportunity for research in Multimedia Data Mining (MDM). Multimedia data mining can be defined as the process of finding interesting patterns from media data such as audio, video, image and text that are not ordinarily accessible by basic queries and associated results. The motivation for doing MDM is to use the discovered patterns to improve decision making. MDM has therefore attracted significant research efforts in developing methods and tools to organize, manage, search and perform domain specific tasks for data from domains such as surveillance, meetings, broadcast news, sports, archives, movies, medical data, as well as personal and online media collections. This paper presents a survey on the prob-lems and solutions in Multimedia Data Mining, approached from the following angles: feature extraction, transformation and representation techniques, data min-ing techniques, and current multimedia data mining systems in various application domains. We discuss main aspects of feature extraction, transformation and repre-sentation techniques. These aspects are: level of feature extraction, feature fusion, features synchronization, feature correlation discovery and accurate representa-tion of multimedia data. Comparison of MDM techniques with state of the art video processing, audio processing and image processing techniques is also provided. Similarly, we compare MDM techniques with the state of the art data mining tech-niques involving clustering, classification, sequence pattern mining, association rule mining and visualization. We review current multimedia data mining systems in detail, grouping them according to problem formulations and approaches. The re-view includes supervised and unsupervised discovery of events and actions from one or more continuous sequences. We also do a detailed analysis to understand what has been achieved and what are the remaining gaps where future research efforts could be focussed. We then conclude this survey with a look at open research directions. 收起

关键词 : survey multimodal data mining probabilistic temporal multimedia data mining video mining audio mining image mining text mining

2. Lowering Barriers for Accessing Sensor Data in Education: Lessons Learned from Teaching Multimodal Learning Analytics to Educators

[期刊] Bertrand Schneider Joseph Reilly Iulian Radu 《Journal for STEM Education Research》 2020年3卷1期共34页

摘要 : In an increasingly data-driven world, large volumes of fine-grained data are infiltrating all aspects of our lives. The world of education is no exception to this phenomenon: in classrooms, we are witnessing an increasing amount o... 展开

关键词 : Multimodal learning analytics Data science Data mining Higher education

原文获取

3. The Multimodal Sentiment Analysis of Online Product Marketing Information Using Text Mining and Big Data

[期刊] Zhuo Fang Yufeng Qian Chang Su Yurong Miao Yanmin Li 《Journal of Organizational and End User Computing》 2022年34卷Pt.2期共19页

摘要 : Currently, the internet is increasingly popular. More people are used to sharing their feelings about various things on the internet. Online product marketing information is also growing. How to mine the required information from ... 展开

关键词 : Attention Mechanism Big Data Convolutional Neural Network Data Mining Multimodal Sentiment Analysis Temporal Convolution Network Text Data Mining Text Mining

4. Fusing audio, visual and textual clues for sentiment analysis from multimodal content

[机翻] 融合音频、视觉和文本线索从多模式内容进行情感分析

[期刊] Poria, Soujanya Cambria, Erik Howard, Newton Huang, Guang-Bin Hussain, Amir 《Neurocomputing》 2016年174卷Jan.22PT.1期共10页

摘要 : A huge number of videos are posted every day on social media platforms such as Facebook and YouTube. This makes the Internet an unlimited source of information. In the coming decades, coping with such information and mining useful... 展开

关键词 : Multimodal fusion Big social data analysis Opinion mining Multimodal sentiment analysis Sentic computing

5. MMCNet: deep learning-based multimodal classification model using dynamic knowledge

[期刊] Sung-Soo Park Kyungyong Chung 《Personal and Ubiquitous Computing》 2022年26卷2期共10页

摘要 : Because of the growth of the business sector dealing in the distribution of movies, software, music, and other contents, a very large amount of contents has accumulated. Accordingly, recommendation systems for inducing user reques... 展开 Because of the growth of the business sector dealing in the distribution of movies, software, music, and other contents, a very large amount of contents has accumulated. Accordingly, recommendation systems for inducing user requests for contents are more important. In distribution businesses, accurate content recommendations are required to secure and retain users. To establish a highly accurate recommendation system, the recommended contents must be accurately classified. As classification methods, mainly techniques such as naive Bayes, SGD (stochastic gradient descent), and SVM (support vector machine), are utilized. If all of the information on recommended subjects is applied in the classification process, high-level accuracy can be expected, but heavy calculation, a long service time, and low scalability are incurred. Given this inefficiency, effective classification in which the metadata of contents are used is required. Metadata are expressed in the forms of the domain concept, relation, type, and attribute to allow the complicated relations between multimodal data (text, images, and video) to be processed efficiently. Most classification systems use single modal data to express one piece of knowledge for an item in a domain. Single modal data are limited in terms of improving classification accuracy, because they do not include the useful information provided by different knowledge types. Therefore, in this paper, we propose MMCNet, a deep learning-based multimodal classification model that uses dynamic knowledge. The proposed method consists of a classification model that applies the human learning principle-based CNN (convolution neural network) to multimodal data in combination with text and image knowledge. By using a Web robot agent, multimodal data are collected from the TMDb (The Movie Database) data set, which includes a variety of single modal data. In the preprocessing procedures, knowledge integration, knowledge conversion, and knowledge reduction are performed to create a quantified knowledge base. To handle text data, sentences are refined through morphological analysis and converted to numerical vectors by using word embedding. Image data are converted to numerical vectors using the library related to vector conversion. The converted feature vectors are utilized to create multimodal learning data and the classification model is used for learning. To solve the problem of memory operation resources, vector model-based meta-knowledge is expanded through expression, conversion, alignment, inference, and deep learning. To evaluate its performance, the proposed model was compared with conventional classification methods in terms of accuracy, recall, and F1-score. According to this evaluation, the proposed classification model improves the accuracy, recall, and F1-score rates more than the conventional methods. In addition, the proposed model was implemented as a deep learning-based multimodal classification system in a graphical user interface environment that allows users to provide feedback about the classification results by adjusting classification parameters. Through the convergence of the knowledge bases of various domains and multimodal deep learning, the dynamic knowledge that influences user preference is inferred. 收起

关键词 : Dynamic knowledge Data mining CNN Multimodal classification Recommendation

6. Maximum-entropy estimated distribution model for classification problems

[机翻] 分类问题的最大熵估计分布模型

[期刊] Ling Tan David Taniar Kate A. Smith 《International Journal of Hybrid Intelligent Systems》 2006年3卷1期共10页

摘要 : Classification is a fundamental problem in machine learning and data mining. This paper applies a stochastic optimization model to classification problems. The proposed maximum entropy estimated distribution model uses a probabili... 展开

关键词 : estimated distribution algorithms hybrid evolutionary algorithms multimodal classification data mining

7. Exploring the relationship between bike-sharing and public transport in Poznan, Poland

[期刊] Radzimski, Adam Dziecielski, Michal 《Transportation Research》 2021年145卷Mar.期共14页

摘要 : It is widely believed that bike-sharing has the potential to encourage sustainable travel by combining the flexibility of cycling with the reliability of public transport. However, there is actually little empirical evidence conce... 展开

关键词 : Bike-sharing Public transport Multimodal travel Spatial regression Big data Data mining

8. Editorial: Multimodal fusion technologies and applications in the context of neuroscience OA

[期刊] Xiaomao Fan Meiyu Qiu Weidong Gao Wenjun Ma Yunpeng Cai Hui Zhou Dingguo Zhang Raffaele Gravina Jian Huang 《Frontiers in Neuroscience》 2023年17卷共2页

摘要 : In recent years, sensor and information technologies have greatly boosted the wearable/portal/medical devices development. A large number of multimodal biomedical signals such as electroencephalography (EEG), electrocardiography (... 展开

关键词 : multimodal data fusionbiomedical signalsdata mining and knowledge discoverydeep learning applicationsneuroscience

9. Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

[机翻] 基于结构化最大边距学习的多媒体数据库多模式数据挖掘

[期刊] Guo, Zhen Zhang, Zhongfei (Mark) Xing, Eric P. Faloutsos, Christos 《ACM transactions on knowledge discovery from data》 2016年10卷3期共30页

摘要 : Mining knowledge from a multimedia database has received increasing attentions recently since huge repositories are made available by the development of the Internet. In this article, we exploit the relations among different modal... 展开

关键词 : Algorithms Experimentation Multimodal data mining image annotation image retrieval max margin

10. Identifying Evolving Groups in Dynamic Multimode Networks

[机翻] 动态多模网络中演化群的识别

[期刊] Tang, Lei Liu, Huan Zhang, Jianping 《Knowledge and Data Engineering, IEEE Transactions on》 2012年24卷1期共14页

摘要 : A multimode network consists of heterogeneous types of actors with various interactions occurring between them. Identifying communities in a multimode network can help understand the structural properties of the network, address t... 展开

关键词 : Data mining community detection community evolution dynamic networks. multimode networks